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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 
(i) APPLICANT: Merck & Co., Inc. 

(ii) TITLE OF INVENTION: CLONING AND IDENTIFICATION 

OF THE MOTILIN RECEPTOR 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Merck & Co . , Inc. 

(B) STREET: P.O. Box 2000, 126 E . Lincoln Ave. 

(C) CITY: Rahway 

(D) STATE: NJ 

(E) COUNTRY: USA 

(F) ZIP: 07065-0900 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 

(D) SOFTWARE: FastSEQ for Windows Version 2.0b 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

( C ) CLASSIFICATION : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/089,098 

(B) FILING DATE: 12-JUN-1998 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Giesser, Joanne M 

(B) REGISTRATION NUMBER: 32,838 

(C) REFERENCE /DOCKET NUMBER: 2 0251 PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 732-594-3046 

(B) TELEFAX: 732-594-4720 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3066 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE :' Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TTGAAATTAT CTGGTCACTG CCGGGCGCGG TGGCTCACGC CTGTAATCCC AGC AC TTTGG 6 0 

GAGGTCGAGG CGGGTGGACC ACCTGGGGTC AGGAGTTCGA GACCAGGCTG GCCAACATGG 12 0 

CGAAACCCTG AC T AC AC AAA AAACACAAAA TTTAGCCGGG GCTTGGGCGC TCCTGTGCTC 180 

CCAGCTACTC AGGAGGCTGA GGTGGGAGGA CTGCTTGAGC CTGGGAGGTC GAGGCTGCAG 24 0 

TGAGCTGTGA TCGCGCCACT TAAACTCCAG CCTGGACGAC AGTGAGACCC TGTCTCAAGA 3 00 

AGAAAAAAAG AAAGAAAGAA AGAAAAAAAG AAAAAAAAGA AATTATTTGG TCAATTATAT 3 60 

GGTCAGCTCC CTCCACCACT CGCGAATTTA CAGAAGAGGA GAACTGGGCT GGGCGAGACC 42 0 

AGGACTAGCC CAAGATTACA CAAGTTACTC GGTTGTAGAG CCAGGATTAG ACAGGAGAGG 4 80 

CTCTAGATTC TGGTCTAGAC TCCCCTCCTA TTATTTAGCA TTATGGCTTC CTGAGGATTA 54 0 

CCATGAGCCC TCCTCCACCG TCAAGCGGCA GCTACCAGCC ACCAGACCAG ATCCCTTCGA 6 00 

AGGTGCCCGG AGTACCAGAC TGAC AAAAGC GCCCGTACAG TGCTCAGTCC TGTAACCAAA 66 0 

GCTGTCTAGG GTGCAGACAT CGCTCACCGG ACCGGGTAGG GCTCGTGCGC TAAGGGCGCC 72 0 

GGGTATTCCA GTTAGTGGAG AGGGAAGCGC CCTGGAACTG CATGGGCCCG GGAGAGGGCG 780 

CGGGAGCGGA GCATGGCCGG GCCGGGGCGG- GCCGCGGCCG TGGGCGGAGA CTGCGCGCAG 84 0 

CTAGCTCGGG AGCGCCTCGG AGCCCACCCC GCAGAGCCGC TTCTCGCGCC CCGCAGCGCA 900 

GCGCAGCGCT CCGCCGTCTG ACCTGCCGCG CCCGCAGCGT GCGGGCTGGG AAAGGAGGCG 96 0 

CTCACCGAGA GGGACCACGC GCCAGGCTCC CAGCCCGACC CGGGACGCGG CGGCCGCGCG 102 0 

GAGCACCCAT GGGCAGCCCC TGGAACGGCA GCGACGGCCC CGAGGGGGCG CGGGAGCCGC 108 0 

CGTGGCCCGC GCTGCCGCCT TGCGACGAGC GCCGCTGCTC GCCCTTTCCC CTGGGGGCGC 114 0 

TGGTGCCGGT GACCGCTGTG TGCCTGTGCC TGTTCGTCGT CGGGGTGAGC GGCAACGTGG 12 00 

TGACCGTGAT GCTGATCGGG CGCTACCGGG ACATGCGGAC CACCACCAAC TTGTACCTGG 12 6 0 

GCAGCATGGC CGTGTCCGAC CTACTCATCC TGCTCGGGCT GCCGTTCGAC CTGTACCGCC 132 0 

TCTGGCGCTC GCGGCCCTGG GTGTTCGGGC CGCTGCTCTG CCGCCTGTCC CTCTACGTGG 13 8 0 

GCGAGGGCTG CACCTACGCC ACGCTGCTGC ACATGACCGC GCTCAGCGTC G AGC GC TAC C 144 0 

TGGCCATCTG CCGCCCGCTC CGCGCCCGCG TCTTGGTCAC CCGGCGCCGC GTCCGCGCGC 1500 

TCATCGCTGT GCTCTGGGCC GTGGCGCTGC TCTCTGCCGG TCCCTTCTTG TTCCTGGTGG 156 0 

GCGTCGAGCA GGACCCCGGC ATCTCCGTAG TCCCGGGCCT CAATGGCACC GCGCGGATCG 162 0 

CCTCCTCGCC TCTCGCCTCG TCGCCGCCTC TCTGGCTCTC GCGGGCGCCA CCGCCGTCCC 16 8 0 

CGCCGTCGGG GCCCGAGACC GCGGAGGCCG CGGCGCTGTT CAGCCGCGAA TGCCGGCCGA 174 0 

GCCCCGCGCA GCTGGGCGCG C TGCGTGTC A TGCTGTGGGT CACCACCGCC TACTTCTTCC 18 00 

TGCCCTTTCT GTGCCTCAGC ATCCTCTACG GGCTCATCGG GCGGGAGCTG TGGAGCAGCC 1860 

GGCGGCCGCT GCGAGGCCCG GCCGCCTCGG GGCGGGAGAG AGGCCACCGG CAGACCGTCC 192 0 

GCGTCCTGCG TAAGTGGAGC CGCCGTGGTT CCAAAGACGC CTGCCTGCAG TCCGCCCCGC 19 80 

CGGGGACCGC GCAAACGCTG GGTCCCCTTC CCCTGCTCGC CCAGCTCTGG GCGCCGCTTC 2 04 0 

CAGCTCCCTC C TATTTC GAT TCCAGCCTCC ACCCGCCGGT ACTTCCCATC CCCCGAGAAA 2100 

ACCATGTCCT GTCCCCCAGG AGCTCTGGGG GACCCCAGGG CGCTTTGAGG GTGGGATCCC 2160 

CGGATCCGAT TCAGTAACCA GCAGTGCTTT TCCAGAGCCT CTGAGACCAG AAAGGAGAGT 222 0 

TGGTAATTCT TAATCCAACC ACCTGTTAGA TGCCACAAAT GAGGAGTCCT CACAGTGCTC 228 0 

TTGAGAAGAC GAGGGAGATT TCATTAAGCT AAAATTTTTT ATTTAATGTT AAGTGATGC T 234 0 

GAAGGCTAAA GTAAACCTTG CTCGTATCAA AAAGTAAAGA TTGTGCAGAC CTGTTGTAGA 240 0 

ATTCTTTTCA ACAGAGAACA GAAAACTTGT CTCCGAAGTG GGTTTGTGGA AGGAAGCCTG 246 0 

CCAAGGCGGC TTGTTCAGAG AAATTGCTCC TTCTGGTTTA TGTCCAGCCT TGATAACACA 252 0 

TATGGGAGCC TAC TATGC AG TTTTAAAGCA AGTATCCATG CAGCCTGCAG CCTGGTCATT 2580 

TTTTCTGGGG TGAGGATCTG CCTAGGTAGA AGTTTTCTCT AATTTATTTT GCTGTTACTT 2 64 0 

GTTATTGCAG ATGGTTCCTT GTCGGGGTGG GGGGTTTATT TGCTTCCCAA TGC TTTTGTT 2700 

AATCCCGGTG CTGTGTCTTA TGTTGCAGTG GTGGTGGTTC TGGCATTTAT AATTTGCTGG 276 0 

TTGCCCTTCC ACGTTGGCAG AATCATTTAC ATAAAC AC GG AAGATTCGCG GATGATGTAC 2 82 0 

TTCTCTCAGT ACTTTAACAT CGTCGCTCTG CAACTTTTCT ATCTGAGCGC ATCTATCAAC 288 0 

CCAATCCTCT ACAACCTCAT TTCAAAGAAG TAC AGAGC GG CGGCCTTTAA ACTGCTGCTC 294 0 

GCAAGGAAGT CCAGGCCGAG AGGCTTCCAC AGAAGC AG GG ACACTGCGGG GGAAGTTGCA 3 00 0 

GGGGACACTG GAGGAGACAC GGTGGGCTAC ACCGAGACAA GCGCTAACGT GAAGACGATG 3 060 

GGATAA 3 06 6 
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(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1239 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



ATGGGCAGCC CCTGGAACGG CAGCGACGGC CCCGAGGGGG CGCGGGAGCC GCCGTGGCCC 60 

GCGCTGCCGC CTTGCGACGA GCGCCGCTGC TCGCCCTTTC CCCTGGGGGC GCTGGTGCCG 120 

GTGACCGCTG TGTGCCTGTG CCTGTTCGTC GTCGGGGTGA GCGGCAACGT GGTGACCGTG 180 

ATGCTGATCG GGCGCTACCG GGACATGCGG ACCACCACCA ACTTGTACCT GGGCAGCATG 240 

GCCGTGTCCG ACCTACTCAT CCTGCTCGGG CTGCCGTTCG ACCTGTACCG CCTCTGGCGC 3 00 

TCGCGGCCCT GGGTGTTCGG GCCGCTGCTC TGCCGCCTGT CCCTCTACGT GGGCGAGGGC 3 60 

TGCACCTACG CCACGCTGCT GCACATGACC GCGCTCAGCG TCGAGCGCTA CCTGGCCATC 420 

TGCCGCCCGC TCCGCGCCCG CGTCTTGGTC ACCCGGCGCC GCGTCCGCGC GCTCATCGCT 480 

GTGCTCTGGG CCGTGGCGCT GCTCTCTGCC GGTCCCTTCT TGTTCCTGGT GGGCGTCGAG 54 0 

CAGGACCCCG GCATCTCCGT AGTCCCGGGC CTCAATGGCA CCGCGCGGAT CGCCTCCTCG 600 

CCTCTCGCCT CGTCGCCGCC TCTCTGGCTC TCGCGGGCGC CACCGCCGTC CCCGCCGTCG 660 

GGGCCCGAGA CCGCGGAGGC CGCGGCGCTG TTCAGCCGCG AATGCCGGCC GAGCCCCGCG 720 

CAGCTGGGCG CGCTGCGTGT CATGCTGTGG GTCACCACCG CCTACTTCTT CCTGCCCTTT 7 80 

CTGTGCCTCA GCATCCTCTA CGGGCTCATC GGGCGGGAGC TGTGGAGCAG CCGGCGGCCG 84 0 

CTGCGAGGCC CGGCCGCCTC GGGGCGGGAG AGAGGCCACC GGCAGACCGT CCGCGTCCTG 900 

CTGGTGGTGG TTCTGGCATT TATAATTTGC TGGTTGCCCT TCCACGTTGG CAGAATCATT 960 

TACATAAACA CGGAAGATTC GCGGATGATG TACTTCTCTC AGTACTTTAA CATCGTCGCT 102 0 

CTGCAACTTT TCTATCTGAG CGCATCTATC AACCCAATCC TCTACAACCT CATTTCAAAG 1080 

AAGTACAGAG CGGCGGCCTT TAAACTGCTG CTCGCAAGGA AGTCCAGGCC GAGAGGCTTC 1140 

CACAGAAGCA GGGACACTGC GGGGGAAGTT GCAGGGGACA C TGGAGGAG A CACGGTGGGC 120 0 

TACACCGAGA CAAGCGCTAA C GTGAAGAC G ATGGGATAA 1239 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1390 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
ATGGGCAGCC CCTGGAACGG CAGCGACGGC CCCGAGGGGG CGCGGGAGCC GCCGTGGCCC 60 
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GCGCTGCCGC CTTGCGACGA GCGCCGCTGC TCGCCCTTTC CCCTGGGGGC GCTGGTGCCG 120 

GTGACCGCTG TGTGCCTGTG CCTGTTCGTC GTCGGGGTGA GCGGCAACGT GGTGACCGTG 180 

ATGCTGATCG GGCGCTACCG GGACATGCGG ACCACCACCA ACTTGTACCT GGGCAGCATG 24 0 

GCCGTGTCCG ACCTACTCAT CCTGCTCGGG CTGCCGTTCG ACCTGTACCG CCTCTGGCGC 3 00 

TCGCGGCCCT GGGTGTTCGG GCCGCTGCTC TGCCGCCTGT CCCTCTACGT GGGCGAGGGC 3 60 

TGCACCTACG CCACGCTGCT GCACATGACC GCGCTCAGCG TCGAGCGCTA CCTGGCCATC 42 0 

TGCCGCCCGC TCCGCGCCCG CGTCTTGGTC ACCCGGCGCC GCGTCCGCGC GCTCATCGCT 480 

GTGCTCTGGG CCGTGGCGCT GCTCTCTGCC GGTCCCTTCT TGTTCCTGGT GGGCGTCGAG 540 

CAGGACCCCG GCATCTCCGT AGTCCCGGGC CTCAATGGCA CCGCGCGGAT CGCCTCCTCG 600 

CCTCTCGCCT CGTCGCCGCC TCTCTGGCTC TCGCGGGCGC CACCGCCGTC CCCGCCGTCG 660 

GGGCCCGAGA CCGCGGAGGC CGCGGCGCTG TTCAGCCGCG AATGCCGGCC GAGCCCCGCG 72 0 

CAGCTGGGCG CGCTGCGTGT CATGCTGTGG GTCACCACCG CCTACTTCTT CCTGCCCTTT 780 

CTGTGCCTCA GCATCCTCTA CGGGCTCATC GGGCGGGAGC TGTGGAGCAG CCGGCGGCCG 840 

CTGCGAGGCC CGGCCGCCTC GGGGCGGGAG AGAGGCCACC GGCAGACCGT CCGCGTCCTG 900 

CGTAAGTGGA GCCGCCGTGG TTCCAAAGAC GCCTGCCTGC AGTCCGCCCC GCCGGGGACC 960 

GCGCAAACGC TGGGTCCCCT TCCCCTGCTC GCCCAGCTCT GGGCGCCGCT TCCAGCTCCC 1020 

TTTCCTATTT CGATTCCAGC CTCCACCCGC CGTGGTGGTG GTTCTGGCAT TTATAATTTG 1080 

CTGGTTGCCC TTCCACGTTG GCAGAATCAT TTACATAAAC ACGGAAGATT CGCGGATGAT 114 0 

GTACTTCTCT CAGTACTTTA ACATCGTCGC TCTGCAACTT TTCTATCTGA GCGCATCTAT 12 00 

CAACCCAATC CTCTACAACC TCATTTCAAA GAAGTACAGA GCGGCGGCCT TTAAACTGCT 12 6 0 

GCTCGCAAGG AAGTCCAGGC CGAGAGGCTT CCACAGAAGC AGGGACACTG CGGGGGAAGT 1320 

TGCAGGGGAC ACTGGAGGAG ACACGGTGGG CTACACCGAG ACAAGCGCTA ACGTGAAGAC 13 8 0 

GATGGGATAA 13 90 



(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 386 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
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Ser 


Gly 


He 


Tyr 


Asn 


Leu 


Leu 


Val 


Ala 


Leu 


Pro 


Arg 


Trp 


Gin 






355 










360 










365 




Asn 


His 


Leu 


His 


Lys 


His 


Gly 


Arg 


Phe 


Ala 


Asp 


Asp 


Val 


Leu 


Leu 


Ser 



370 375 380 



Val Leu 
385 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1092 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

ATGCCCTGGA CCAGACCCCA GGTGGACCTC CATGCTGCTG CAGCAGAGAC CATGGACCAG 6 0 

TACACCACGG ACGACCACCA CTACGAGGGC TCCCTCTTCC CCGCGTCCAC CCTCATCCCC 12 0 

GTCACGGTCA TCTGCATCCT CATCTTCGTG GTCGGCGTGA CCGGCAACAC C ATG AC CATC 18 0 

CTCATCATCC AGTACTTCAA GGACATGAAG ACCACCACCA ACCTGTACCT GTCCAGCATG 240 

GCCGTGTCCG ACCTCGTCAT CTTCCTCTGC CTGCCCTTCG ACCTGTACCG CCTGTGGAAG 300 

TACGTGCCGT GGCTGTTCGG CGAGGCCGTG TGCCGCCTCT ACCACTACAT CTTCGAAGGC 360 

TGCACGTCGG CCACCATCCT CCACATCACG GCCCTGAGCA TCGAGCGCTA CCTGGCCATC 42 0 

AGCTTCCCCC TCAGGAGCAA GGTGATGGTG ACCAGGAGAA GGGTCCAGTA CATCATCCTG 480 

GCCCTGTGGT GCTTCGCCCT GGTGTCGGCC GCTCCCACGC TCTTCCTGGT CGGGGTGGAG 540 

TACGACAACG AGACGCACCC CGACTACAAC ACGGGCCAGT GCAAGCACAC GGGCTACGCC 600 

ATCAGCTCGG GGCAGCTGCA CATCATGATC TGGGTGTCCA CCACCTACTT CTTCTGCCCG 660 

ATGCTGTGTC TCCTCTTCCT CTACGGCTCC ATCGGGTGCA AGCTGTGGAA GAGCAAGAAC 720 

GACCTGCAGG GCCCGTGCGC CCTGGCCCGC GAGAGGTCGC ACAGGCAAAC GGTGAAGATC 780 
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CTGGTGGTGG TGGTGCTGGC CTTCATCATC TGCTGGCTGC CCTACCACAT CGGCAGGAAC 840 

CTGTTCGCCC AGGTGGACGA CTACGACACG GCCATGCTCA GCCAGAATTT CAACATGGCC 900 

TCCATGGTGC TCTGCTACCT CAGCGCCTCC ATCAACCCCG TCGTCTACAA CCTGATGTCG 96 0 

AGGAAGTACC GGGCCGCCGC CAAGCGCCTC TTCCTGCTCC ACCAGAGACC CAAGCCGGCC 102 0 

CACCGGGGGC AGGGGCAGTT TTGCATGATC GGCCACAGCC CCACCCTGGA CGAGAGCCTG 108 0 

AC GGGGGTGT GA 10 92 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



Met 


Pro 


Trp 


Thr 


Arg 


Pro 


Gin 


Val 


Asp 


Leu 


His 


Ala 


Ala 


Ala 


Ala 


Glu 


1 








5 










10 










15 




Thr 


Met 


Asp 


Gin 


Tyr 


Thr 


Thr 


Asp 


Asp 


His 


His 


Tyr 


Glu 


Gly 


Ser 


Leu 








20 










25 










30 






Phe 


Pro 


Ala 


Ser 


Thr 


Leu 


He 


Pro 


Val 


Thr 


Val 


He 


Cys 


He 


Leu 


He 






35 










40 










45 








Phe 


Val 


Val 


Gly 


Val 


Thr 


Gly 


Asn 


Thr 


Met 


Thr 


He 


Leu 


lie 


He 


Gin 




50 










55 










60 










Tyr 


Phe 


Lys 


Asp 


Met 


Lys 


Thr 


Thr 


Thr 


Asn 


Leu 


Tyr 


Leu 


Ser 


Ser 


Met 


65 










70 










75 










80 


Ala 


Val 


Ser 


Asp 


Leu 


Val 


He 


Phe 


Leu 


Cys 


Leu 


Pro 


Phe 


Asp 


Leu 


Tyr 










85 










90 










95 




Arg 


Leu 


Trp 


Lys 


Tyr 


Val 


Pro 


Trp 


Leu 


Phe 


Gly 


Glu 


Ala 


Val 


Cys 


Arg 








100 










105 










110 






Leu 


Tyr 


His 


Tyr 


lie 


Phe 


Glu 


Gly 


Cys 


Thr 


Ser 


Ala 


Thr 


He 


Leu 


His 






115 










120 










125 








lie 


Thr 


Ala 


Leu 


Ser 


He 


Glu 


Arg 


Tyr 


Leu 


Ala 


He 


Ser 


Phe 


Pro 


Leu 




130 










135 










140 










Arg 


Ser 


Lys 


Val 


Met 


Val 


Thr 


Arg 


Arg 


Arg 


Val 


Gin 


Tyr 


He 


He 


Leu 


145 










150 










155 










160 


Ala 


Leu 


Trp 


Cys 


Phe 


Ala 


Leu 


Val 


Ser 


Ala 


Ala 


Pro 


Thr 


Leu 


Phe 


Leu 










165 










170 










175 




Val 


Gly 


Val 


Glu 


Tyr 


Asp 


Asn 


Glu 


Thr 


His 


Pro 


Asp 


Tyr 


Asn 


Thr 


Gly 








180 










185 










190 






Gin 


Cys 


Lys 


His 


Thr 


Gly 


Tyr 


Ala 


He 


Ser 


Ser 


Gly 


Gin 


Leu 


His 


He 






195 










200 










205 








Met 


lie 


Trp 


Val 


Ser 


Thr 


Thr 


Tyr 


Phe 


Phe 


Cys 


Pro 


Met 


Leu 


Cys 


Leu 




210 










215 










220 










Leu 


Phe 


Leu 


Tyr 


Gly 


Ser 


He 


Gly 


Cys 


Lys 


Leu 


Trp 


Lys 


Ser 


Lys 


Asn 


225 










230 










235 










240 


Asp 


Leu 


Gin 


Gly 


Pro 


Cys 


Ala 


Leu 


Ala 


Arg 


Glu 


Arg 


Ser 


His 


Arg 


Gin 










245 










250 










255 




Thr 


Val 


Lys 


lie 


Leu 


Val 


Val 


Val 


Val 


Leu 


Ala 


Phe 


He 


He 


Cys 


Trp 








260 










265 










270 






Leu 


Pro 


Tyr 


His 


lie 


Gly 


Arg 


Asn 


Leu 


Phe 


Ala 


Gin 


Val 


Asp 


Asp 


Tyr 






275 










280 










285 









WO 99/64436 



-7- 



0 



^ **• *» a«J IxZSi -*Lll : *-*iL 

WO 99/64436 PCTYUS99/1 2773 



Asp 


Thr 
290 


Ala 


Met 


Leu 


Ser 


Gin 
295 


Asn 


Phe 


Asn 


Met 


Ala 
300 


Ser 


Met 


Val 


Leu 


Cys 


Tyr 


Leu 


Ser 


Ala 


Ser 


He 


Asn 


Pro 


Val 


Val 


Tyr 


Asn 


Leu 


Met 


Ser 


305 










310 










315 










320 


Arg 


Lys 


Tyr 


Arg 


Ala 


Ala 


Ala 


Lys 


Arg 


Leu 


Phe 


Leu 


Leu 


His 


Gin 


Arg 










325 










330 










335 


Pro 


Lys 


Pro 


Ala 
340 


His 


Arg 


Gly 


Gin 


Gly 
345 


Gin 


Phe 


Cys 


Met 


He 
350 


Gly 


His 


Ser 


Pro 


Thr 


Leu 


Asp 


Glu 


Ser 


Leu 


Thr 


Gly 


Val 













355 360 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
CCATCCTAAT ACGACTCACT ATAGGGC 27 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
TTATCCCATC GTCTTCACGT TAGCGCTTGT CTC 3 3 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTGCCCTTTC TGTGCCTCAG CATCCTCTAC 30 
(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATGGGCAGCC CCTGGAACGG CAGCGACGGC CCCGAGGGGG CGCGGGAGCC GCCGTGGCCC 60 

GCGCTGCCGC CTTGC GACG A GCGCCGCTGC TCGCCCTTTC CCCTGGGGGC GCTGGTGCCG 12 0 

GTGACCGCTG TGTGCCTGTG CCTGTTCGTC GTCGGGGTGA GCGGCAACGT GGTGACCGTG 180 

ATGCTGATCG GGCGCTACCG GGACATGCGG ACCACCACCA AC TTGT AC C T GGGCAGCATG 240 

GCCGTGTCCG ACCTACTCAT CCTGCTCGGG CTGCCGTTCG ACCTGTACCG CCTCTGGCGC 300 

TCGCGGCCCT GGGTGTTCGG GCCGCTGCTC TGCCGCCTGT CCCTCTACGT GGGCGAGGGC 3 60 

TGCACCTACG CCACGCTGCT GCACATGACC GCGCTCAGCG TCGAGCGCTA CCTGGCCATC 42 0 

TGCCGCCCGC TCCGCGCCCG CGTCTTGGTC ACCCGGCGCC GCGTCCGCGC GCTCATCGCT 480 

GTGCTCTGGG CCGTGGCGCT GCTCTCTGCC GGTCCCTTCT TGTTCCTGGT GGGCGTCGAG 540 

CAGGACCCCG GCATCTCCGT AGTCCCGGGC CTCAATGGCA CCGCGCGGAT CGCCTCCTCG 600 

CCTCTCGCCT CGTCGCCGCC TCTCTGGCTC TCGCGGGCGC CACCGCCGTC CCCGCCGTCG 660 

GGGCCCGAGA CCGCGGAGGC CGCGGCGCTG TTCAGCCGCG AATGCCGGCC GAGCCCCGCG 72 0 

CAGCTGGGCG CGCTGCGTGT C ATGC TGTGG GTCACCACCG CCTACTTCTT CCTGCCCTTT 78 0 

CTGTGCCTCA GCATCCTCTA CGGGCTCATC GGGCGGGAGC TGTGGAGCAG CCGGCGGCCG 840 

CTGCGAGGCC CGGCCGCCTC GGGGCGGGAG AGAGGCC AC C GGCAGACCGT CCGCGTCCTG 900 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Met 


Gly 


Ser 


Pro 


Trp 


Asn 


Gly 


Ser 


Asp 


Gly 


Pro 


Glu 


Gly 


Ala 


Arg 


Glu 


1 








5 










10 










15 




Pro 


Pro 


Trp 


Pro 


Ala 


Leu 


Pro 


Pro 


Cys 


Asp 


Glu 


Arg 


Arg 


Cys 


Ser 


Pro 








20 










25 










30 






Phe 


Pro 


Leu 


Gly 


Ala 


Leu 


Val 


Pro 


Val 


Thr 


Ala 


Val 


Cys 


Leu 


Cys 


Leu 






35 










40 










45 








Phe 


Val 


Val 


Gly 


Val 


Ser 


Gly 


Asn 


Val 


Val 


Thr 


Val 


Met 


Leu 


He 


Gly 




50 










55 










60 










Arg 


Tyr 


Arg 


Asp 


Met 


Arg 


Thr 


Thr 


Thr 


Asn 


Leu 


Tyr 


Leu 


Gly 


Ser 


Met 


65 










70 










75 










80 


Ala 


Val 


Ser 


Asp 


Leu 


Leu 


He 


Leu 


Leu 


Gly 


Leu 


Pro 


Phe 


Asp 


Leu 


Tyr 










85 










90 










95 




Arg 


Leu 


Trp 


Arg 


Ser 


Arg 


Pro 


Trp 


Val 


Phe 


Gly 


Pro 


Leu 


Leu 


Cys 


Arg 








100 










105 










110 






Leu 


Ser 


Leu 


Tyr 


Val 


Gly 


Glu 


Gly 


Cys 


Thr 


Tyr 


Ala 


Thr 


Leu 


Leu 


His 






115 










120 










125 








Met 


Thr 


Ala 


Leu 


Ser 


Val 


Glu 


Arg 


Tyr 


Leu 


Ala 


He 


Cys 


Arg 


Pro 


Leu 



130 135 140 
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Arg 


Ala 


Arg 


vai 


Leu 


Val 


Thr 


Arg 


Arg 


Arg 


val 


Arg 


TV "1 _ 

Ala 


Leu 


lie 


Ala 


145 










1 D U 










Id 0 










1 0 U 


Val 


Leu 


Trp 


Ala 


Val 


Ala 


Leu 


Leu 


C n -v- 

ber 


Aia 


Cjiy 


Pro 


Phe 


Leu 


Phe 


Leu 










ICC 










1 / u 










1 / D 




Val 


Gly 


val 




(jin 


Asp 


Pro 


L»iy 


lie 


Ser 


vai 


vai 


Pro 


Lily 


Leu 


Asn 








TOO 

1 oU 










lo J 










ion 

1 y u 






Gly 


Thr 


Ala 


Arg 


lie 


Ala 


ber 


Cor 


Pro 


Leu 


7\ n =1 
A±a 


Ser 


C £1 -K- 

ber 


Pro 


Pro 


Leu 






i y D 










0 n n 
z u u 










one 








Trp 


Leu 


Ser 


Arg 


Ala 


Pro 


Pro 


Pro 


Ser 


Pro 


Pro 


Ser 


1 ^ r 

biy 


Pro 


bill 


Thr 




Tin 




















0 0 n 

ZzU 










Ala 


Glu 


Ala 


Ala 


Ala 


Leu 


Phe 


Ser 


Arg 


CjIU 


Cys 


Arg 


Pro 


Ser 


Pro 


Ala 


225 










230 










235 










240 


Gin 


Leu 


Gly 


Ala 


Leu 


Arg 


Val 


Met 


Leu 


Trp 


Val 


Thr 


Thr 


Ala 


Tyr 


Phe 










245 










250 










255 




Phe 


Leu 


Pro 


Phe 


Leu 


Cys 


Leu 


Ser 


He 


Leu 


Tyr 


Gly 


Leu 


He 


Gly 


Arg 








260 










265 










270 






Glu 


Leu 


Trp 


Ser 


Ser 


Arg 


Arg 


Pro 


Leu 


Arg 


Gly 


Pro 


Ala 


Ala 


Ser 


Gly 






275 










280 










285 








Arg 


Glu 


Arg 


Gly 


His 


Arg 


Gin 


Thr 


Val 


Arg 


Val 


Leu 












290 










295 










300 











(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CGTAAGTGGA GCCGCCGTGG TTCCAAAGAC GCCTGCCTGC AGTCCGCCCC GCCGGGGACC 60 
GCGCAAACGC TGGGTCCCCT TCCCCTGCTC GCCCAGCTCT GGGCGCCGCT TCCAGCTCCC 12 0 

TTTCCTATTT CGATTCCAGC CTCCACCCGC CGGT 154 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 602 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

AGCTGGTGGT GGTTCTGGCA TTTATAATTT GCTGGTTGCC CTTCCACGTT GGCAGAATCA 60 

TTTACATAAA C AC GGAAGAT TCGCGGATGA TGTACTTCTC TCAGTACTTT AACATCGTCG 120 

CTCTGCAACT TTTCTATCTG AGCGCATCTA TCAACCCAAT CCTCTACAAC CTCATTTCAA 180 

AGAAGTACAG AGCGGCGGCC TTTAAACTGC TGCTCGCAAG GAAGTCCAGG CCGAGAGGCT 240 

TCCACAGAAG CAGGGACACT GCGGGGGAAG TTGCAGGGGA CACTGGAGGA GACACGGTGG 3 00 

GCTACACCGA GACAAGCGCT AACGTGAAGA C GATGGG AT A ACGTAAGTGG AGCCGCCGTG 3 60 

GTTCCAAAGA CGCCTGCCTG CAGTCCGCCC CGCCGGGGAC CGCGCAAACG CTGGGTCCCC 42 0 



-10- 



WO 99/64436 



PCT/US99/12773 



TTCCCCTGCT CGCCCAGCTC TGGGCGCCGC TTCCAGCTCC CTTTCCTATT TCGATTCCAG 4 80 

CCTCCACCCG CCGTGGTGGT GGTTCTGGCA TTTATAATTT GCTGGTTGCC CTTCCACGTT 540 

GGCAGAATCA TTTACATAAA C AC GGAAGAT TCGCGGATGA TGTACTTCTC TCAGTACTTT 600 

AA 602 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Leu Val Val Val 
1 

Gly Arg lie lie 
20 

Ser Gin Tyr Phe 
35 

Ser lie Asn Pro 
50 

Ala Ala Phe Lys 
65 

His Arg Ser Arg 

Asp Thr Val Gly 
100 

Arg Lys Trp Ser 
115 

Pro Pro Gly Thr 
130 

Leu Trp Ala Pro 
145 

Thr Arg Arg Gly 

Pro Arg Trp Gin 
180 

Val Leu Leu Ser 
195 



Leu Ala Phe lie 

5 

Tyr lie Asn Thr 

Asn lie Val Ala 
40 

lie Leu Tyr Asn 
55 

Leu Leu Leu Ala 
70 

Asp Thr Ala Gly 
85 

Tyr Thr Glu Thr 

Arg Arg Gly Ser 
120 

Ala Gin Thr Leu 
135 

Leu Pro Ala Pro 
150 

Gly Gly Ser Gly 
165 

Asn His Leu His 
Val Leu 



lie Cys Trp Leu 
10 

Glu Asp Ser Arg 
25 

Leu Gin Leu Phe 

Leu lie Ser Lys 
60 

Arg Lys Ser Arg 
75 

Glu Val Ala Gly 
90 

Ser Ala Asn Val 
105 

Lys Asp Ala Cys 

Gly Pro Leu Pro 
140 

Phe Pro lie Ser 
155 

lie Tyr Asn Leu 
170 

Lys His Gly Arg 
185 



Pro Phe His Val 
15 

Met Met Tyr Phe 
30 

Tyr Leu Ser Ala 
45 

Lys Tyr Arg Ala 

Pro Arg Gly Phe 
80 

Asp Thr Gly Gly 
95 

Lys Thr Met Gly 
110 

Leu Gin Ser Ala 
125 

Leu Leu Ala Gin 

lie Pro Ala Ser 
160 

Leu Val Ala Leu 
175 

Phe Ala Asp Asp 
190 
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